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METHOD, APPARATUS, AND SYSTEM FOR EFFICIENT RATE CONTROL IN 

AUDIO ENCODING 



5 FIELD OF THE INVENTION 

The present invention relates to the field of signal 
processing. More specifically, the present invention 
relates to a method, apparatus, and system for efficient 
rate control in audio encoding. 

10 

BACKGROUND OF THE INVENTION 

As technology continues to advance and the demand for 
video and audio signal processing continues to increase at 
a rapid rate, effective and efficient techniques for signal 

15 processing and data transmission have become more and more 
important in system design and implementation. Various 
standards or specifications for audio signal processing 
have been developed over the years to standardize and 
facilitate various coding schemes relating to audio signal 

20 processing. In particular, a group known as the Moving 

Pictures Expert Group (MPEG) was established to develop a 
" standard or specification for the coded representation of 
moving pictures and associated audio stored on digital 
storage media. As a result, a standard known as the 
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ISO/IEC 11172-3 (Part 3 - Audio) CODING OF MOVING PICTURES 
AND ASSOCIATED AUDIO FOR DIGITAL STORAGE MEDIA AT UP TO 
ABOUT 1.5 MBITS/S (also referred to as the MPEG standard or 
MPEG specification herein), published August, 1993, was 
5 developed which standardizes various coding schemes for 
audio signals, e.g., MPEG-1 or MPEG-2 Layers I, II, and 
III. ISO stands for International Organization for 
Standardization and IEC stands for International 
Electrotechnical Commission, respectively. Generally, the 

10 MPEG audio specification does not standardize the encoder 

but rather the type of information that an encoder needs to 
produce and write to an MPEG compliant bitstream, as well 
as the way in which the decoder needs to parse, decompress, 
and resynthesize this information to regain the encoded 

15 audio signals. In particular, MPEG standard is developed 
for perceptual audio coding rather than lossless coding. 
In lossless coding, redundancy in the waveform is reduced 
to compress the sound signal and the decoded sound wave 
does not differ from the original sound wave. In contrast, 

20 in perceptual audio coding, the aim is not to regain the 
original signal exactly after encoding and decoding but 
rather to eliminate those parts of the audio signal that 
are irrelevant to the human ear (e.g., that are not heard). 
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An audio encoder typically includes a bit allocation 
module or unit (also called the bit allocator herein) whose 
role is to allocate more bits to those frequencies where 
quantization noise is audible to a listener and allocate 
5 fewer bits to those frequencies where quantization noise is 
masked and is inaudible to the listener. Also, the bit 
allocator needs to ensure that the total number of bits 
used for a specific audio block or frame does not exceed 
the maximum number of bits available as determined by the 

10 specified output bit rate. Currently, the methods for 
performing the bit allocation, as described in the MPEG 
standard includes two processing loops: (1) an outer or 
distortion control loop; and (2) an inner or rate control 
loop. One of the problems or disadvantages associated with 

15 the current methods described in the ISO/IEC 11272-3 MPEG 
standard is their inefficiency due to numerous iterations 
involved in determining or computing the optimum 
quantization parameters that will satisfy the rate 
criteria . 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The features of the present invention will be more 
fully understood by reference to the accompanying drawings, 
in which: 

5 Figure 1 is a block diagram of one embodiment of an 

encoder in which the teachings of the present invention may 
be implemented; 

Figure 2 is a flow diagram illustrating an inner or 
. rate control loop of a bit allocation method according to 
10 the current ISO/IEC specification; 

Figure 3 shows a flow diagram illustrating an outer or 
distortion control loop of a bit allocation method 
according to the current ISO/IEC specification; 

Figures 4,5, and 6 illustrate examples of the 
15 progression from an initial global gain value to a final 

global gain value, in accordance with one embodiment of the 
present invention; 

Figure 7 shows an example of a curve where the 
estimation of the global_gain leads to a value of the 
20 total_bits that is below but not close to the target_bits; 

Figure 8 shows a flow diagram of one embodiment of a 
fate control process according to the teaching of the 
present invention; and 

4 
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Figure 9 shows a flow diagram of a process in 
accordance with one embodiment of the present invention. 
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DETAILED DESCRIPTION 

In the following detailed description numerous 
specific details are set forth in order to provide a 
thorough understanding of the present invention. However, 
5 it will be appreciated by one skilled in the art that the 
present invention may be understood and practiced without 
these specific details. Furthermore, while the teachings 
of the present invention are applicable to MPEG Layer III 
(commonly known as MP3) audio encoding, it should be 

10 appreciated and understood by one skilled in the art that 
the present invention is not limited to MPEG Layer III 
audio encoding and can be applied to any method, 
apparatus, and system for efficient bit allocation to 
accomplish bit rate reduction in audio processing. 

15 Figure 1 is a block diagram of one embodiment of an 

encoder 100 in which the teachings of the present invention 
may be implemented. In one embodiment, the audio encoder 
100 may include a filter bank structure or unit 110, a 
psycho-acoustic model (PAM) 120, a bit allocator and 

20 quantizer 13 0, a Huffman encoder 14 0, and a bit stream 

formatter 150. In one embodiment, input audio samples such 
" as pulse code modulation (PCM) samples are fed into the 
filter bank unit 110 and transformed using a filter bank to 
generate output sub-band samples. In MP3 audio encoding, 
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the output sub-band samples can be further processed using 
a Modified Discrete Cosine Transform (MDCT) to obtain 
higher frequency resolution. The input PCM samples are also 
input to the Psycho-Acoustic model 12 0, which independently 
5 analyzes the input data and models human auditory 

perception. The psycho-acoustic model 120 is designed and 
configured to determine the ear sensitivity to noise in the 
frequency domain. In one embodiment, the output from the 
psycho-acoustic model 120 is a frequency mask that 

10 describes the maximum allowed quantization noise in each of 
the bands. Both the MDCT output spectrum and the frequency 
mask are then input into the bit allocator and quantizer 
130. The function of the bit allocator (also called bit 
allocation module herein) in block 13 0 is to allocate more 

15 bits to those frequencies where quantization noise is 
audible to the listener and allocate fewer bits to 
frequencies where quantization noise is masked by program 
material and is inaudible to the listener. Furthermore, the 
bit allocator needs to ensure that the total number of bits 

20 used for a specific PCM block (or frame) does not exceed 
the maximum number of bits available as determined by the 
" specified output bit rate. The output generated from the 
bit allocator and quantizer 130 is then input into the 
Huffman encoder 140. The bitstream formatter 150 is 
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configured to generate output encoded audio frames based on 
the data received from the Huffman encoder 140. 

Figure 2 is a flow diagram illustrating an inner or 
rate control loop of a bit allocation method according to 
5 the current ISO/IEC specification. Generally, the rate 
control loop is responsible for selecting a glohaljgain 
value (also called the quantizer step size value herein) to 
insert in the following quantization formula: 



10 



ix(i) = nint 



K(i)l 

global _ gain 



0.0946 



(1) 



where ix corresponds to the quantized spectral values for 
frequency line i, and xr corresponds to the original 
spectral value. Since the quantized values will be further 

15 encoded using Huffman tables, the glohaljgain parameter 

first is adjusted so that the maximum quantized value falls 
below the maximum limit of the corresponding Huffman look- 
up tables described in ISO/IEC specification. This is done 
according to the ISO/IEC spec by continuously increasing 

20 the global_gain value until the maximum quantized value is 
less or equal to the maximum Huffman lookup table (LUT) 
index (e.g. 8191 for MP3 encoding). After selecting the 
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minimum global_gain to allow Huffman table look-up, the 
next task is to ensure that the number of bits used for 
Huffman encoding does not exceed the maximum number of bits 
allocated for the block of spectral values. This is done 
5 according to the ISO/IEC spec by continuously increasing 
the global_gain value until the number of bits used for 
encoding is equal or less than the maximum number of bits 
allocated for the block. As shown in Figure 2, at block 
210, the global_gain value is initially set to zero or to 

10 some initial estimate. At block 215, the spectral values 
are quantized. At decision block 220, if the maximum 
quantized spectral value is within the corresponding 
Huffman table limit, then the process continues to block 
225, otherwise the process proceeds to block 230. At block 

15 230, the. value of the global_gain is increased (e.g., 

incremented by 1) and the process loops back to block 215. 
At block 225, a number of bits used for Huffman encoding is 
determined. At decision block 235, if the number of bits 
used for Huffman encoding exceeds the maximum number of 

20 bits allocated for the block of spectral values, then the 
process proceeds to block 24 0 to increase the value of the 
global_gain (e.g., increment the value of the global_gain 
by 1), otherwise the process proceeds to end at block 290. 
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At block 24 5, the spectral values are quantized. The 
process then loops back from block 245 to block 225. 

Figure 3 shows a flow diagram illustrating an outer or 
distortion control loop of a bit allocation method 
5 according to the current ISO/IEC specification. Generally, 
after determining a global_gain value to meet the rate 
criteria as described above, the outer or distortion 
control loop computes the amount of distortion introduced 
by the quantization. This is accomplished by decoding the 

10 quantized value and finding the mean-squared error (MSE) , 
or some other distortion measure, between the decoded 
spectral value and the original spectral value within each 
scalefactor band (group of frequency lines) . Scalefactor 
bands not meeting the distortion criteria are amplified by 

15 some prescribed factor and the rate control loop is called 
iteratively with the new amplified spectral values, until 
the distortion criteria is met for all the bands. As shown 
in Figure 3, at block 310 the rate control loop as 
described in Figure 2 is called to determine a global_gain 

20 value. At block 315, for each scalefactor band, the 

process proceeds as follows. At block 320, the distortion 
for the respective band is calculated. At decision block 
325, if the distortion calculated does not meet the 
distortion criteria (e.g., the. distortion calculated is not 
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less than the maximum distortion allowed) then the process 
proceeds to block 33 0 to amplify the respective band by a 
predetermined factor. At decision block 335, if the 
distortion criteria is met for all the bands (e.g., no 
5 distorted bands) , then the process proceeds to end at block 
390. Otherwise the process loops back to block 310. 

As mentioned above, a disadvantage associated with the 
methods disclosed in the ISO/IEC document is their 
inefficiency due to the numerous iterations involved in 
10 computing the glohal_gain value to satisfy the rate 

criteria.. As described in more details below, according 
to the teachings of the present invention, a new method is 
provided for efficient bit allocation of spectral values 
obtained from a sub-band filter. In one embodiment of the 
15 present invention, the method as described herein is 

directed to improving the efficiency of the rate control 
loop (also called rate control process herein) . The method 
as described herein includes the following: 

Deriving a closed form equation to determine the 
20 global_gain to meet the maximum Huffman look-up limit; and 
° Using a modified Newtonian search to determine the 
' global _gain required to meet the rate criteria. 

Accordingly, at a high level, the present invention 
includes two parts or two components as follows: (1) 
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efficient determination of a minimum , global_gain value to 
meet the maximum Huffman look-up criteria; and (2) 
efficient determination of a global_gain value to meet 
the rate criteria within the rate control loop. 



D etermining the minimum global gain val ue to meet the 
maximum Huffman look-up criteria 

Huffman tables that are used in a typical audio 
encoder are limited to a maximum quantized value that can 

10 be looked up using the table index. For example, Huffman 
tables that are used in a typical MP3 encoder are limited 
to a maximum quantized value of 8191 that corresponds to 13 
bits of precision (2 13 entries) . Therefore, the maximum 
quantized value for the block of spectral values needs to 

15 be bounded to the maximum index into the corresponding 
Huffman tables. For illustration and generalization 
purposes, the maximum quantized value is called a. In the 
case of MP3 encoding, a = 8191. Equation (2) below can be 
obtained using equation (1) shown above: 



20 ix(i) = nint 



7 . y /4 



x 



global _ gain 

2- 4 



+ 0.0946 



£<x (2) 



Removing the nint [] function (standing for nearest 
integer), the following equation (3) can be obtained: 
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10 



15 



' x,(0| 



global _ gain 



K2 



3/4 



+ 0.0946 + £<oc (3) 



where s is the error introduced by quantizing to the 
nearest integer, and therefore: 

|s|<0.5 (4) 

In one embodiment, using =0.5 and setting 
|x r (i)| = MAX|x r (i)| will result in the largest value for the left 
hand side of equation (3), where MAX|x r (i)| represents the 
largest spectral value magnitude across the frequency lines 
indexed by i. Therefore, equation (3) can be re-written as: 



. ^3/4 

MAXlx r (i)| 

global _ gain 

2 4 



+ 0.0946 + 0.5 < a (5) 



The following equations (6) - (10) are used to solve 
equation (5) for the variable global_gain. Equation (5) can 
be rewritten as follows: 



r \ 3M 
MAX|x T (i)| 

global gain 



< a -0.5946 



(6) 



^ 2 4 

Taking the 4/3 root on both sides of equation (6) , 
equations (7) is obtained as shown below: 



MAX l Xr(i) '<[g-0.5946r 3 (7) 

elobal gain L 



global _ gain 

2 4 
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10 



Solving for 2 3lobal - 9ain/4 results in the following 



equation : 



giobal.gain MAX|x r (i)| 
) 4 > 1 1 1 



(8) 



[a-0.5946] 4 ' 3 

Taking the logarithm base 2 of both sides of equation 
(7) , the following equation is obtained: 



global, gain > ^ 
4 



MAX|x r (i)| 



(9) 



Ja-0.5946] 4/3 J 

Solving for global_gain results in equation (10) shown 
below: 

MAX|x r (i)| 



global _ gain > 4 • log 2 



(10) 



v [a-0.5946] 4/3 

Since global_gain needs to be an integer number, take 
the ceiling of equation (10) to obtain the following 
equation : 

f MAX|x r (i)| 



global _ gain > 



4 1og 2 



(ID 



[a-0.5946] 4/: 

where |~x~| corresponds to the nearest integer that is 
15 greater than or equal to x. Therefore, the minimum 

global_gain value required to meet the maximum Huffman 
table entry a, can be computed from equation (11) . 



Efficient determination of a global gain value to meet 
20 rate criteria 
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In one embodiment of the present invention, a modified 
Newtonian search process or algorithm is developed as 
described in more details below to find the roots of the 
following equation: 
5 total_bits = f Huffina „(ix)= f Hllffilia n(global_gain) < t arg et_bits (12) 

where f Huffman (• ) corresponds to the total number of bits used 
during Huffman encoding of* the quantized values ix, which 
as shown in equation (12) is a function of global_gain. The 
value target_bits correspond the maximum number of bits to 

10 be encoded per audio frame. In one embodiment, this value 
is dependent on a desired compression ratio or output bit 
rate and the input audio frame. For example, in MP3 
encoding, the input audio frames include 1152 PCM samples 
per channel. If the input sampling rate of the audio signal 

15 is 44.1KHz (or 44100 samples/sec), and the encoding is to 
be done at 128 Kbits/sec, then the target_bits for one 
channel of an audio frame can be computed as follows: 

tareet bits = 128000bits/ S ec-1152sample S _ < bitsusedforM p 3header > 
5 - 44 100 samples /sec 

In general, a Newtonian search process works by 
20 calculating the line tangent to an -unknown" 'surface and 
• using the intercept of this line as a new guess for the 
root of the surface or function. 
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Figures 4,5, and 6 illustrate examples of a 
progression from an initial global_gain value, ggO, towards 
a final global_gain, gg4 , that satisfies the condition in 
equation (12), according to the teachings of the present 
5 invention. In one embodiment, linear convergence faster 

than the ISO/IEC method or ISO/IEC algorithm is achieved by 
using the x intercept to determine a new global_gain, which 
yields a bit allocation value closer to target_bits . 

Generally, the Newton search algorithm or process is a 
10 special case of a class of root finding techniques based on 
Nth-order polynomials. Specifically, the Newton search 
corresponds to a 1 st order polynomial. This root finding 
technique derives from the Taylor Series of a function f (x) 
at some 5 interval from x as follows: 

15 f(x + 6) = f(x) + f(x)5 + f"(x)y + ... + f n (x)^- + ... (13) 

where f n (x) corresponds to the n th derivative of function 
f (x) . 

For relatively smooth functions, derivatives of 2 nd 
order and above may be negligible, and therefore, f (x+8) may 
20 be approximated by: 

f(x + S)«f(x) + f(x)8 ( 14 > 
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In trying to find the value of x for which the 

function is equal to some value c, set f (x+8) = c, and 

obtain the following: 

c-f(x) (15) 
f(x) 

Equation (15) corresponds to the Newton approximation. 
For the bit allocation problem as described herein, x is 
substituted with the global_gain; f (x) is substituted with 
the total Huffman bits, f Huffman (global_gain) ; c is the 
desired root, in this case target_bits; and 6 corresponds to 
the step size to be used to obtain a new global_gain . For 
clarity purposes, the f (global_gain) is used to represent 
fHuffman(global_gain) from now on. Therefore, equation (15) 
becomes : 

t arg et _ bits - f (global _ gain) (16) 
Sgiobai.gain * f (global _ gain) 

The derivative, f ' (global_gain) , at iteration i, can 

be numerically approximated as follows: 

. x f (global gain . ) - f (global _ gain M ) , . 

f (global.ganO * global _ gairli _ gIob ai_g a in H 

The estimation of the function's derivative uses the 
previously computed global_gain. This estimation of the 
derivative is sometimes called in literature as the Secant 
method for finding roots. Generally, this technique is 
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simple and works well with well-behaved functions as in the 
case of Huffman tables. However, it should be understood 
and appreciated by one skilled in the art that any 
derivative estimation technique can be used in accordance 
5 with the teachings of the present invention. 

In one embodiment, the assumption in the use of a 1 st 
order polynomial is that the function to be searched is 
relatively smooth and its derivative is close to a straight 
line. For example, the Huffman tables used for MPEG 

10 encoding are designed so that the total number of bits 
decreases progressively towards 0 as the global_gain is 
increased. Therefore, this implies that the function 
f <global_gain) is well behaved, and a 1 st order polynomial 
will suffice. In one embodiment, the straight line for the 

15 derivative is then used to estimate a new global_gain, 
i.e., global_gain n+ i . 

Two issues may arise when using a Newtonian search 
with equation (12) : 

o First, a large step size in the global_gain value will 
20 cause the algorithm to converge rapidly. However, the 

global_gain estimation should be as close as possible to 
the target_bits. Figure 7 shows an example of a curve 
where the estimation of the global_gain leads to a value of 
the total bits that is below the target_bits. However, 
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this is not the closer one to the target_bits, and hence, 
it is non-optimal. 

o Second, since global_gain needs to be an integer value, 
the global_gain value gets truncated to the closer integer 
5 that is less than or equal to the obtained global_gain 
during each iteration. As the search progresses in the 
iterations and gets closer to target_bits, the step size 
for estimating the new global_gain may be less than 1, 
which means that global_gain will not change and therefore 

10 the process would enter a non- convergent cycle. 

In one embodiment of the present invention, the first 
issue was addressed by allowing the search process to back- 
track to a smaller value of global_gain after it reaches a 
global_gain that satisfies the condition in equation (12) . 

15 In one embodiment, this back- tracking can be repeated more 
than once. Then, the global_gain that results in a 
total __bits closer to target_bits is selected. Usually, the 
selection may not be necessary, since the last global_gain 
after N times is the closer one to the target_bits. The 

20 times the process is allowed to reach a total_bits that 
satisfies equation (12) is denominated as "go_up" in the 
flow diagram shown in Figure 8 described below. 

In one embodiment, the second issue was addressed by 
forcing the global__gain during each iteration to be updated 
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by at least a positive integer (e.g., +1) or a negative 
integer (e.g., -1), depending on the direction of the 
search. A positive integer such as +1 is used if the 
process is still progressing down towards target_bits, and 
5 a negative integer such as -1 is used when the process 
reaches a total_bits below target_bits and the search is 
continued. 

In one embodiment of the present invention, the 
global_gain parameter is stored in memory to be used as an 

10 initial estimate for the next block of spectral values. Two 
initial values of total_bits (tb 0 and tbi) computed from two 
initial global_gains (gg 0 and ggx respectively) are used to 
start the iteration. In one embodiment, gg 0 is taken as the 
global_gain pre -computed as described above and ggi can be 

15 computed as follows: 

gg , = max(gg 0 + P, global _ gain from previous block) (18) 
where f3 can be a predetermined positive integer that can be 
optimized to increase the convergence rate. For example, a 
value of 5 for /? can be used. In one embodiment, the 

20 global_gain of the previous block is compared with gg 0 to 
ensure that the criteria of equation (11) is met for gg x . 

Figure 8 shows a flow diagram of one embodiment of a 
rate control process (also called rate control loop) 800 
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according to the teaching of the present invention. At 
block 810, a first initial value of the global_gain 
parameter (e.g., ggO) is computed. In one embodiment, the 
first initial value ggO is computed using equation (11) as 
5 described above. At block .812, a second initial value of 
the global_gain parameter (e.g., ggl) is computed, based on 
equation (18) as described above. At block 814, the 
spectral values are quantized using ggO . At block 816, a 
first initial value for the total_bits parameter is 
10 computed. In one embodiment, the first initial value for 
the total_bits is computed based on the Huffman encoding 
bits for ggO. At decision block 818, if the first initial 
value of the total_bits tbO is below the target_bits value 
then the process proceeds to end at block 890. Otherwise, 
15 the process proceeds to block 820 to quantize the spectral 
values using ggl. At block 822, a second initial value of 
the total_bits is computed. In one embodiment, the second 
initial value of the total_bits is computed using the 
Huffman encoding bits for ggl. At decision block 824, if 
20 the second initial value of the total_bits is below the 
target_bits value then the process proceeds to block 826, 
otherwise the process proceeds to block 828. At block 826, 
increase the number of iterations go_up (e.g., increment 
go_up by 1) and set the direction to back track to a 
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smaller value of global_gain (e.g., direction = -1). At 
block 828, since the current value of the total_bits is not 
below the target_bits value, set the direction to progress 
down towards the target_bits (e.g., direction = 1). The 
5 process then proceeds either from block 826 to block 830 or 
from block 828 to block 832. At block 830, if the maximum 
number iterations is reached (e.g., go_up > max_go_up) , 
then the process proceeds to end at block 890, otherwise 
the process proceeds to block 832. At block 832, two new 

10 initial values of the global_gain parameter are computed 

for another iteration, based on the previous values of the 
global_gain, the previous values of the total_bits, and the 
target_bits value. The process then loops back from block 
832 to block 820 to continue the search for the desired 

15 global_gain value. 

Figure 9 shows a flow diagram of a process in 
accordance with one embodiment of the present invention. 
At block 910, audio samples (e.g., PCM samples) 
representing an input audio signal are received. At block 

20 920, the input audio samples are transformed into a vector 
of spectral values in a frequency domain. At block 930, a 
value of a quantizing parameter that satisfies one or more 
criteria is determined, based at least in part, on a 
modified Newtonian search process. The determined value of 
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the quantizing parameter is used to quantize the respective 
vector of spectral values to generate a vector of quantize 
values . 

As described above, several other root finding 
5 techniques can also be used in place of the Newtonian 

search. The theory behind some of the various techniques is 

discussed below. 

Higher Order Polynomials 

Higher order polynomials may be used to estimate the 
10 root of the function. For an Nth order polynomial, equation 
(13) is truncated after the N th derivative. For example, a 
2 nd order polynomial will correspond to: 

S 2 

f (x + 8) = f (x) + f (x)8 + f "(x) y (1^) 

In order to obtain the value of 6 that will satisfy the 
15 root condition, the following quadratic equation needs to 
be solved: 

C = f(x) + f'(x)8 + f"(x)y (20) 

Also, it is required to estimate the 2 nd derivative of 

the function f (x) . If equation (17) is used to estimate the 

20 2 nd derivative, the following is obtained: 

fM/ , - . . . f (global _ gain ■ ) - f ' (global gain M ) 

f M (global garni )« T7~~; : TT~1 : { } 

V6 ~ 6 1 global _ gamj - g loba l _ gain M 

" which requires storing of the derivative at iteration i-1. 
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The technique of using a 2 nd order polynomial, and 
using equation (21) to estimate the 2 nd derivation of the 
function is commonly known in the art as the Muller's 
method. 

5 initial global gain Estimation 

In one embodiment of the present invention, more than 
one global_gain values are stored in memory for the 
estimation of the initial Newton search conditions. In one 
embodiment, gg 0 is computed according to equation (11) and 

10 ggi is computed according to the following equation: 

( \ 
gg™ =max(gg™+P,c 0 +2c k •global_gain k ,k = m-l,m-2,...,m-Nj (22) 

where m corresponds to the current audio frame under 
iteration and c k are empirically determined coefficients. 
The coefficients c k could be determined by executing a 

15 regression of global_gain in audio frame m against the 

global_gain values from the previous N frames. Any other 
error minimization technique could also be used to estimate 
the global_gain coefficients. 

The invention has been described in conjunction with 

20 the preferred embodiment. It is evident that numerous 

alternatives, modifications, variations and uses will be 
apparent to those skilled in the art in light of the 
foregoing description. 
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